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I . Ductal carcinoma in situ (DCIS) lesions are non-invasive tumours of the breast which are thought to 

QQ ' precede most invasive breast cancers (IBC). As individual DCIS lesions are initiated, grow and invade 

w-s I (i.e. become IBC) the size distribution of the DCIS lesions present in a given human population will 

evolve. We derive a differential equation governing this evolution and show, for given assumptions 

about growth and invasion, that there is a unique distribution which does not vary with time. Further, 

we show that any initial distribution converges to this stationary distribution exponentially quickly. It 

is therefore reasonable to assume that the stationary distribution governs the size of DCIS lesions in 

human populations which are relatively stable with respect to the determinants of breast cancer. Based on 

this assumption and the size data of 110 DCIS lesions detected in a mammographic screening program 

>v>( ' between 1993 and 2000, we produce maximum likelihood estimates for certain growth and invasion 

j_j I parameters. Assuming that DCIS size is proportional to a positive power p of the time since tumour 

C^ . initiation, we estimate/7 to be 0.50 witha95% confidence interval of (0.35,0.71). Therefore we estimate 

that DCIS lesions follow a square-root growth law and hence that they grow rapidly when small and 

relatively slowly when large. Our approach and results should be useful for other mathematical studies 

of cancer, especially those investigating biological mechanisms of invasion. 

Keywords: Breast cancer, cancer progression, DCIS, population dynamics, maximum likelihood 



1. Introduction 



Most breast cancers arise in th e epithelial cells which line the miUc ducts of each breast JErbas et al. 



(120061) : ISilverstein et al.\ (l2006h ). While a tumour remains confined to the system of ducts it is known 



as a ductal carcinoma in situ (DCIS) of the breast. If the tumour penetrates the epithelial membrane 
which bounds the ductal system then it is termed an invasive breast cancer (IBC). In this paper we will 
use the term tumour to mean either DCIS or IBC and we use DCIS lesion or simply lesion as a synonym 
for DCIS. IBCs are considered to be life-threatening whereas DCIS lesions per se are not, since the 
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epithelial membrane prevents DCIS lesions from spreading to critical organs. 

The transition from DCIS to IBC is known as invasion. It is not known what biological events trigger 
invasion or what proportion of DCIS lesions would invade if left untreated because essentially all DCIS 
lesions are surgically removed once discovered. However, as this surgery can be psychologically and 
physically damaging and could potentially spread cancerous cells, determining the risk associated with 
untreated DCIS is of great clinical importance. 

Currently it is not ethically possible to directly observe the natural history of DCIS (though see 
Welch et al.\ (12008 1)) so our knowledge comes from indirect sources: follow-up studies of DCIS initially 
misdiagnosed as benign; studies of recurrence of DCIS as invasi ve cancer; autopsy studie s; studies of 



risk factors for DCIS; animal studies; and mathematical studies (lErbas et al.\ ilOOai ). As Erbas et al. 



(120061) point out, all of these studies have weaknesses for estimating the proportion of DCIS lesions 
which invade, e.g. lesions observed in follow-up studies are likely to be of low histological grade and 
consequently less prone to invasion than other DCIS lesions. In this context, a mathematical study of 
DCIS, despite its relia nce on simpHfying assumptions, may be able to make an important contribution 
to the ongoing debate (IWelch et al.\ (l2008h ') about the clinical significance of DCIS. 

Previous mathematical studies of breast canc er have generally f ocussed on IBC. Some of these 
studies do not incorporate any DCIS phase (Atkinson et al.\ (Il983h ; iHart et al.\ (119981) ) while others 
include a DCIS phase only a s a period of latency between tumour initiation and th e start of invasive 
growth (iKopans et a/.l(l2003b ; lMoolgavkar & L uebeck'('2003');'Yakovlev et al' (l996)). By contrast, the 
present paper is chiefly concerned with DCIS. Franks et al. (2005) also explic itly study DCIS though 
their approach is very different to the one presented here. IDewanji et al\ (119891) use a general two-stage 
model of cancer growth and progression to derive interesting results which are applicable to DCIS, 
though their approach is also very different from the one presented here, being based on stochastic 
processes rather than differential equations and with age rather than calendar time as the time variable. 

DCIS size is defined as the maximal linear extent of the lesion, or its diameter. While volume is 
often used t o describe the size of I BCs, the linear or branched shape of DCIS makes this definition 
impractical (iSilverstein et al.\ (120061) ). 



The rest of this paper i s set out as follows. In S|2]we use the results of Diekma nn et al.\ (119841) and 
the example of i Hart et al.\ (11998) to give an equation governing the time-evolution of the DCIS size 
distribution of a given human population as individual lesions grow and invade. For given assumptions 
about growth and invasion there is a unique stationary distribution which does not vary with time. Fur- 
ther, we prove (in ^ that any initial DCIS size distribution converges to this stationary distribution 
exponentially quickly. Therefore, for a human population which is relatively stable with respect to the 
determinants of breast cancer, it is reasonable to assume that the stationary distribution is close to the 
true DCIS size distribution. We use this assumption in fU^nd |j5]to generate maximum likelihood esti- 
mates for certain DCIS growth and invasion parameters, based on data from a mammographic screening 
program. In i]6]we draw some conclusions and finish with some ideas for future research. 



2. The stationary distribution of DCIS size 

In this section we find an equation governing the evolution of the size distribution of DCIS lesions 
present in a human population as individual lesions are initiated, grow and invade. For given growth and 
invasion rates there is a unique stationary distribution which does not vary with time a nd we give an ex 



pressio n for this distribution. As men tioned above, this section is based on the results of lDiekmann et al. 
(Il984 and motivated by the ideas of lHart et all (Il998b . 
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Diekmann et al.\ ( 11984 ') derive an equation governing the time-evolution of the size distribution of a 
population of cells which grow, die and reproduce by binary fission. Setting the fission rate to zero gives 
an equation describing the size distribution of a population of individuals which grow and stochastically 
leave the population, so this applies to a population of DCIS lesions which grow and invade. Here we 
assume that DCIS lesions invade and grow independently of each other, even if they occur in the same 
woman. This assumption is only approximately true, but multiple (detectable) DCIS lesions per woman 
seem to be very rare so this assumption probably has a similar level of validity to the independence 
assumptions of most practical stat i stics. 

So following lDiekmann et g/.l (119841) . though using sUghtly different notation, we assume that indi- 
vidual DCIS lesions of size x grow and invade at rates f{x)/a and ji{x), respectively, where the growth 
factor a > is constant for each tumour Hence the size y{T) of a DCIS lesion at time T evolves 
according to the differential equation 






(2.1) 



and if X is a random variable giving the tumour size at invasion (conditional on a) then Pr{X > x) is 



G{x) — exp — 






d^ 



(2.2) 



(cf. (2.4) of Diekmann et al.\ (119841) '). The function / determines the DCIS growth law and we assume 
it is such that any solution y: [0, °o) — > K of (12. Il l with y{0) approximately the size of one epithelial cell 
is a strictly monotonically increasing function. For the rest of this section we treat the growth factor a 
as a constant but in later sections we will allow it to vary randomly from tumour to tumour 

Now, let (j){x,t) be an (unnormalized) density function for the DCIS lesions in the population, i.e. 
suppose that (j) is such that Jq (j){£,,t)d£, is the expected num ber of lesions at time f with size in the 
range [0 , x] , for any x > and t . Then applying equation (2.1) of lDiekmann et al.\ (119841) to our situation 
(where the fission rate b is zero) gives 



d<j) 
'dt 



d_ 

dx 



a 



-Hip. 



(2.3) 



So far we have not accounted for the proc ess of tumour in i tiation . However a straight-forward 



modification of the argument in the Appendix of Diekmann et al.\ (119841) shows that if tumours of size x 



are initiated at a rate S{x), assumed constant in time, then (j) wiU evolve according to 



dip 
'dt 



d_ 
dx 



a 



-^lip + S. 



(2.4) 



Note that S{x) is the rate of initiation with respect to calendar time, not age, so our assumption that S{x) 
is independent of time is roughly equivalent to the assumption that the number of DCIS lesions which 
are initiated in the human population is the same from month to month. The source term S is unknown 
but since DCIS lesions are initially approximately the size of one epithelial cell we know that S{x) — 
for all jc > e where e > is microscopically small. 

Using ( I2.2I 1 to write jj. in terms of G we arrive at the following inhomogenous linear equation gov- 
erning the time-evolution of : 



^ /^ //_/g:a 



dt a di 
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Now, let \ir be the unnormalized stationary distribution, meaning that it satisfies (I2.5l l and is a func- 
tion of X only, i.e. dyz/dt ~ 0. Then for all macroscopic x, (I2.5l l becomes 

f f W' f G' 

= v/- -—- — + — 

^a\ w f G 

where the primes denote differentiation with respect to x. This has the unique solution 

V/(x)=c^ (2.6) 

where c is a constant. If c is chosen so that J^\i/{x)dx — 1 then we refer to i// as the (normaUzed) 
stationary distribution. 

3. Convergence to the stationary distribution 



In §2] we used the results of iDiekmann et al.\ (119841) to show that, for given growth and invasion rates, 



there is a unique DCIS size distri bution which does not vary with time. Unfortunately the convergence 



results of lDiekmann et al.\ (119841) cannot be directly applied to our situation of continuous tumour initia- 
tion and zero fission rates (cf. the assumptions H/, on page 229 of Diekmann ef a/. (1984) ). Therefore, in 
this section we prove that any distribution converges to the stationary distribution exponentially quickly 
(and with a reasonable time-scale). This result adds legitimacy to our use of the stationary distribution 
in [j5]to produce maximum likelihood estimates of DCIS growth and invasion parameters. However the 
proof is fairly technical and a reader who is prepared to accept this result can skip this section without 
compromising his or her understanding of the later sections . 

We now fix a value of a for the rest of this section and we use the notation of fJ2] throughout except 
that we write (p^ and (pt for the partial derivatives of (p. In this notation, ( 12.5b becomes 

We will show that for any function (p which satisfies ( 13. Il l, the sequence of functions (p{-,t) converges to 
the unnormalized stationary distribution i/a as f — ;• oo. The proof will be given for the model of invasion 
used in i|4]and !J5] where /x(x) = h is constant, but the proof extends to other models without major 
modifications. 

It follows from (13. Il l that 8^(p — \f/ satisfies the homogenous partial differential equation 

5r+(l-)5..+ (i^-i^)5 = 0. (3.2) 

\aj \a aG J 

Our aim is to show that 5(-,r) approaches as f ^oo. Since ( 13.21 ) is a linear, fir st-ord er partial differential 
equation we can solve this equation by the method of characteristics (e.g. see |johnl(ll978.) pages 8-14). 
The graph 

{(.x,f,z)e]R^|z = 5(.x,f)} 

of a solution 5 of ( 13.2b is known as an integral surface, and the images of the integral curves of the 
vector field 

m^z.mG'ix)_.m\ ^,,^ 

a aG{x) a J 
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are the characteristic curves of ( I3.2l i. The projection of a characteristic curve into the xf -plane is called 
a characteristic. Our interest in the characteristic curves derives from the fact that every integral surface 



is a union of characteristic curves (UohnldigTSb ). 

Now, let 5o : M>o — > R be given, where IR>o denotes the positive reals. From now on we let 5 
denote the solution of (13.2b which satisfies the initial conditions 5(^,0) — 8q{£,) for every ^ > 0. Let 
{x,t) e K>o X K. be given and assume for the rest of this section that y is the solution of ( 12.1b with 
y{t) = X. The characteristic which passes through the point (x,f ) is the graph of y, i.e. the characteristic 
is the trajectory of a tumour which is of size x at time t. If the characteristic does not meet the positive 
X-axis then the initial conditions above do not determine 5(x,f ). In this case we set 5{x,t) =0 since no 
tumour in existence at time has size x at time t and there is no source term in (13. 2t . On the other hand, 
if the characteristic through (x,f) meets the positive x-axis then the initial conditions determine 5{x,t) 
as follows. 

It is easily checked that any integral curve 7 of ( 13.3b is given by 

for some XQ,tQ,zo £ K where the function Y solves ( 12. lb with Y{tQ) = xq. However, since the corre- 
sponding characteristic curve (i.e. the image of 7) does not depend on to we may assume without loss of 
generality that to = 0. Suppose now that the image of 7 is the characteristic curve which passes through 
(x,f , 5(x,f)). This implies firstly that Y{t) — x and hence Y =y and secondly that 7 lies in the integral 
surface of 5, so from 7(0) = (xo,0,zo) we have zo — 5o(xo). Combining this with ( 13.4b gives 

5ix,t)^5oiIix,t))-^/-ili^ (3.5) 

G{I{x,t)) /(x) 

where we have written /(x,f) in the place of xq to make its dependence on x and f explicit. Here /(x,f ) 
is the initial size of a tumour which is of size x at time t, i.e. /(x,f) is the function of x and t which is 
determined by the condition I{x,t) — y{Q) where, as above, y solves ( 12.1b with y{t) — x. 

As in Sj4]we assume that the time between tumor initiation and invasion is a random variable T with 
constant hazard h, i.e. that Pr{T > t) — exp{—ht). Let X be the tumour size at invasion and recall that 
G{£, ) = Pr{X > £,)■ Then for a tumour which was initiated at time a, X — y{T + a) and so 

Gi^)=PriX > ^)^Pr{y{T + a) > ^) = Pr{T > y-\^) -a) ^ exp{-h{y-\^) -a)). 

This is consistent with ( 14.4b because a = in ijj] Therefore, regardless of the value of a. 



G{x) 
G{I{x,t)) 



-ht 



(3.6) 



Now, from ( 12.1b we have 



JO a Ji(xj)f[t,) 

where F is any function which satisfies F' = \ //. This equation holds for arbitrary x and t so taking 
partial derivatives with respect to x and rearranging gives 
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So by combining ( 13.51 ), ( 13.61 ) and ( I3.7l i we obtain 

|5(^,f)|^^=e-''' r|5o(l)Ml. (3.8) 





This shows that the functions 5{-,t) approach exponentially quickly with respect to the L^ norm 
as f — > oo. Therefore any DClS size distribution approaches the stationary distribution exponentially 
quickly as t ^>-oo. 

Since h is the hazard of invasion, the time-scale of convergence to the stationary distribution is 
comparable to the time for an individual DClS lesion to invade, perhaps of the order of 20 years. Hence 
the rate of convergence is comparable to the rate of population changes which affect the incidence 
of breast cancer such as changes to the distributions of: age at first birth; age at menarche; pre- and 
post-menopausal body-mass index; and exogenous hormone use. So in human populations where the 
determinants of breast cancer have changed slowly in recent years, we would expect the population 
DClS size distribution to track with the changes in the underlying stationary distribution without ever 
deviating too far from it. 

The above arguments are conditional on a, however since the rate of convergence in (13.8) does not 
depend on a, the same convergence results apply when we marginalize over a as in the next section. 

4. Parametric assumptions 

In Sj5]we will use the method of maximum likelihood to estimate certain important features of DCIS 
growth and invasion. As for most likelihood-based estimation procedures we must first make some 
parametric assumptions. In this section we describe these assumptions and give an explicit expression 
for the corresponding stationary distribution. 



Hart et al.\ (Il998h show that the invasive breast cancers cannot grow according to exponential, lo- 
gistic or Gompertz laws, but can grow according to a power law. Following this work, we assume that 
DCIS size is proportional to a positive power of the time since tumour initiation. This corresponds to 
the choice 

fiy)=kyP (4.1) 

in ( 12. Il l where ^ > and )3 < 1 are constants. Then (12. Il l has the solution 

y{t) = Ml -l3)/a+4-P]T^ (4.2) 



where xq is the initial size of the DCIS lesion (see equation (2) of iHart et al.\ il99st) ). Therefore the 



inverse of y is given by y ^{x) — [x^ ^ ^o ]oc/(^(l ~ P))- Assuming that xq is negligibly small 
compared to x (which is reasonable, since xq is approximately the size of one epithelial cell) we have 

Now, as in !J2] let X be a random variable representing tumour size at invasion conditional on the 
growth factor a and define G{x) = Pr{X > x). Let T be the time from tumour initiation to invasion 
conditional on a, i.e. T = y^^{X) where y^' is as given in (14.31 1. We model invasion by one of the 
simplest assumptions possible in our framework, namely that the hazard function for T is some constant 
h > 0, i.e. that 

Pr{t ^ T <t + At I f < r) =hAt + 0{At^), 
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SO Pr{T > t) = exp(— /if ). Combining this with G{x) = Pr{T > y ' [x)) gives 

G{x) = exp [— /!y"^(x)] . 
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(4.4) 



Substituting (14.3b into ( 14.4b then substituting the result into ( 12.6b gives an expression for the stationary 
distribution )//« conditional on a: 



V/a(x) ^CaX '^exp 






(4.5) 



where Ca does not depend on x. The integral /q°° y/a (x)(ix exists for /3 < 1 and substituting ( 14.5b into the 
equation /q°° \ifa{x)dx = 1 and effecting a simple change of variables shows 

Ca — ah/k. 



Now, as in lBrown et g/.l (11984ft we take the growth factor a to be exponentially distributed. Due to 
the presence of k in ( 14.1b we may assume without loss of generality that the expected value of a is 1 . 
Marginalising (14.5b over a gives the PDF of the (unconditional) stationary distribution; 



Va(x) 



e "'^a{x)da 



X ^hk ' / a exp 

/o 



a 



hx'-P 

k{l~p) 



da 



= x-^kh-^ 
where the last step uses integration by parts. 



kh 



-'+x'- 



(1-^)- 



-2 



(4.6) 



5. Comparison with mammography data 

In ij3]we showed that any DCIS size distribution approaches the stationary distribution exponentially fast 
(and on a relatively short time-scale). Therefore it seems reasonable to assume that the size distribution 
of DCIS lesions present in a real human population will approximate the stationary distribution corre- 
sponding to the processes of growth and invasion in operation. In this section we use this assumption, 
together with the parametric assumptions of iJH to obtain maximum likelihood estimates for the growth 
and invasion parameters introduced in fjH This estimation will be based on DCIS size data collected by 
a mammographic screening program operated by BreastScreen Victoria (BSV), in Victoria, Australia 
since 1993. 

5.1 The BSV dataset 

The BreastScreen Victoria (BSV) data was collected between 1993 and 2000. In the Australian state 
of Victoria, women are invited to attend free breast screen clinics every 2 years between the ages of 50 
and 69 years. We restricted our data to this age range and to DCIS lesions which were detected via a 
woman's first mammogram. The ages at first mammogram were approximately uniformly distributed 
across this age range. The size of a DCIS lesion was measured by a pathologist after it was detected via 
a mammogram and surgically removed. Most DCIS lesions are roughly linear in shape, and for such 
lesions, DCIS size is essentially the length of the lesion. 
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Since very few pre-invasive breast cancers are detected outside of mammographic screening pro- 
grams, the BSV dataset is an excellent source of data on larger DCIS lesions. However, detection of 
a small DCIS lesion via mammographic screening is uncertain, and probably depends on a number of 
factors such as the orientation of the lesion, its position in a den s e or n on-dense part of the breast and 
the presence or absence of micro-calcification (ISilverstein et al\ (l2006l) V Therefore, in order to avoid 
these complications, we restricted the BSV data to those lesions with size greater than or equal to 15mm 
(though see i]5.3b . We also excluded two outlying DCIS lesions with size greater than 100mm. This 
gave us a dataset of 110 DCIS lesions of size at least 15mm which were detected on a woman's first 
mammogram when she was between 50 and 69 (inclusive) years old. A histogram of these sizes is given 
in Figure [T] The 25%, 50% and 75% quartiles for these sizes were 18, 24, and 35mm, respectively, 
while the age of participants had a mean of 58 years and a range of 50 to 69 years. 




I 

60 
DCIS size (mm) 



Fig. 1. A histogram of DCIS size in the BSV dataset, with the fitted DCIS size distribution overiaid 



5.2 Maximum likelihood estimates 

We would now like to use the PDF ( 14.61 1 of the stationary distribution to obtain maximum likelihood 
estimates. However since we have restricted the BSV data to tumours with size greater than or equal to 
15mm we must first condition this PDF appropriately. A simple change of variables shows 



Jl5 



^f{x)dx 
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SO the appropriate PDF for the BSV data is 



1 



kh-^l-p) 



'"'-' 



[/t/i-i+xi-/^(l-j3)-i] 



Note that h and A; only appear in ( 15. Il l in the combination A;/; ' . Therefore it is not possible to pro- 
duce separate maximum likelihood estimates for the parameters h and k, only for the quantity O^kh^^. 
Therefore we define the following likelihood function 






L{ii,9)^Yl-;^ l---;; :' (5.2) 



where xi ,X2, . . . ,xi lo are the sizes (in millimeters) of the DCIS lesions in the BSV dataset. 

We re-parameterized the likelihood (15. 2t in terms of jS and log Q then calcula t ed m aximum likeli- 



hood estimates using the 'mle' function of R 2.11.1 dR Development Core TeamI (12006.) ). The corre- 
sponding estimates for j3 and Q are 

(J3,0) = (-0.99,30.9). 

The PDF ( 15.11 ) corresponding to these estimates is overlaid on the histogram of the BSV data in Figure 
[T] showing a good fit. 

We generated conservative 95% confidence intervals (95% CI) for j3 and by projecting the re- 
gion of the parameter space where the relative log-likelihood is greater than —5.99 onto the j3 and 
e co-ordinate axes (see §4.5 of .Severini. (2000,) ). This gave a 95% CI of (-1.70,-0.48) for j3 and 
(0.00, 1278) fore. 

5.3 Sensitivity analysis 

We varied the size cutoff, introduced because of the uncertain detection of small lesions, in order to 
assess its impact on our estimates. Conditioning on DCIS size greater than or equal to 10mm, 15mm or 
20mm gave approximately the same estimates for j3, as can be seen from Table [T] 

5.4 Interpretation of the results 

The confidence interval for d is so wide that we cannot draw any conclusions regarding the growth or 
invasion rates of DCIS lesions. However the estimate for )3 is more reliable. By (I4.2l i our estimate j3 = 
—0.99 corresponds, to a good approximation, to DCIS growth in which the lesion size is proportional 
to the square root of the time since tumour initiation. The bounds of the 95% CI place the exponent 



Table 1 . Results of a sensitivity analysis in which the size cutoff is varied 

Cutoff (mm) Number of DCIS lesions j3 (95% CI) (95% CI) 

-1.07 (-1.70, -0.47) 86(5.9,661) 
-0.99 (-1.86, -0.41) 31 (0.00, 1278) 
-1.06 (-2.13, -0.55) 0.41 (0.00, 4357) 
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between 0.35 and 0.71, corresponding to a class of curves with similar, very rapid initial growth which 
then slows with increasing time and DClS size. 

This may be biologically plausible for two reasons: (1) unlike large lesions, small lesions are un- 
constricted by the epithelial membrane and surrounding tissues which bound the milk-ducts; and (2) it 
may be that large lesi ons receive less nut rients per cell than small ones due to limited contact with the 
epithelial membrane JFranks et al\ (l2005b ). 

Even if the assumption of a power-law for growth is overly restrictive, our results still suggest that 
DCl S growth slows do wn as the lesion increases in size, in contrast to IBC which grows at an increasing 
rate dHart et al\ ( Il998l) ). 

A square-root growth function may also be the key to explaining a surprising feature of the BSV 
data. The size distribution for DClS lesions detected on a woman's first mammographic screening is 
remarkably similar to the size distribution for tumours detected on a subsequent screen (see Figure 
121). This is surprising because first-screen-detected tumours could have been growing for more than 20 
years while subsequent-screen-detected tumours can only have been present and of a detectable size for 
2 years (the usual time between screens). Our estimate that DCIS growth is initially very rapid and then 
relatively slow may help to explain this feature. Note that the estimates of ij5.2l were only based on the 
size data of DClS lesions detected on the first screen. 




T^ 



T^ 



20 40 60 80 

DCIS size (mm) at first screen 



Fig. 2. A quantile-quandle plot comparing the DCIS size distributions of lesions which are detected on a woman's first mammo- 
graphic screening with those which are detected on a subsequent screen. 
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6. Conclusions and future research 

Because the natural history of DCIS cannot ethically be observed, indirect studies of DCIS growth 
and invasion such as the present one are needed to resolve the controversies surrounding the clinical 
significance of DCIS . The main contribution of our paper to this debate is our estimate that DCIS lesions 
approximately obey a square-root growth function and that they grow rapidly when small and relatively 
slowly when large. However, we have also shown how the unobserved processes of DCIS growth and 
invasion affect the observed DCIS size distribution, and we hope that this approach will also be useful 
for future studies of DCIS. 



A limitation of our approach is that, unlike stochastic models like that of lDewanii et al.\ (Il989h . our 
model does not allow for the possibility of DCIS regression. In theory, regression seems possible, though 
we are not aware of any data to support this hypothesis so the notion of DCIS regression remains fairly 
controversial. Another limitation is that we have ignored the possibility that DCIS lesions might leave 
the population due to migration or death of the host, in addition to invasion. However, our estimates 
were based on women aged 50 to 69 years, who have reasonably low mortality rates, so we don't expect 
this to affect our estimates greatly. 

A more serious limitation is that, due to the form of the likelihood function, it is impossible to 
produce separate estimates for the rates of DCIS growth and invasion (see il5.2l i. However, it might be 
possible to overcome this limitation by incorporating information on DCIS lesions which are detected 
at a woman's second or later mammographic screening. Without such an extension, even with a larger 
dataset, it is probably not possible to use our approach to estimate the proportion P of DCIS lesions 
which will invade within a lifetime. This proportion is of great clinical importance because if it is very 
small then the curr ent treatment regime of mandatory removal may be causing more harm than good, as 
suggested recently (Welch et al. (2008)). In any case, we hope that our estimate of the DCIS growth law 
will be used in other mathematical studies of DCIS and that one of these will produce credible estimates 
fox P. 

Another possible use of our approach is as a test on competing theories of invasion. At present, the 
biological events which trigger invasion are not known, however different biological invasion mecha- 
nisms will generally give different forms for the distribution of DCIS size at invasion. For example, if 
it is assumed that invasion occurs when a DCIS lesion grows along a milk duct to a pre-existing 'hot 
spot' (which might be a weakness in the epithelial membrane or a mutant stromal cell) then rather than 
( 14.4b we would have G{x) — exp(— /zjc) for some constant /z > 0. It may therefore be possible to use the 
methods of this paper to test different biological theories of invasion by computing maximum likelihood 
estimates and comparing the parsimony of the different models using a measure such as the Bayesian 
information criterion. 

Acknowledgements 

We would like to acknowledge and thank BreastScreen Victoria for the provision of data and the US 
Department of Defense Breast Cancer Concept Award (DAMD 17-03-0687) which funded aspects of 
this work. Views and opinions of, and endorsements by the authors do not reflect those of BreastScreen 
Victoria, the US Army or the Department of Defense. 

References 

Atkinson EN, Bartoszynski R, Brown BW, Thompson JR (1983) On estimating the growth 
function of tumors. Math Biosciences, 67, 145-166. 



^ -h 



4 -k 



1 2 of [12] DOWTY, BYRNES & GERTIG 

Brown B W, Atkinson EN, Bartoszynski R, Thompson JR, Montague ED (1984) Estimation 
of human tumor growth rate from distribution of tumor sizes at detection, J Nat Cancer Inst 72, 31- 
38. 

Dewanji a, Venzon DJ, Moolgavkar SH (1989) A stochastic two-stage model for cancer risk 
assessment II The number and size of premalignant clones. Risk analysis 9, 179-187. 

DiEKMANN O, Heijmans HJAM, Thieme HR (1984) On the stabiUty of the cell size distribution, / 
Math Biology 19, 227-248. 

Erbas B, Provenzano E, Armes J, Gertig D (2006) The natural history of ductal carcinoma in 
situ of the breast: a review. Breast Cancer Res Treat 97, 135-144. 

Franks SJ, Byrne HM, Underwood JCE, Lewis CE (2005) Biological inferences from a math- 
ematical model of comedo ductal carcinoma in situ of the breast, J Theor Biol, 232, 523-543. 

Hart D, Shochat E, Agur Z (1998) The growth law of primary breast cancer as inferred from 
mammography screening trials data, Br J Cancer, 78, 382-387. 

John F (1978) Partial differential equations (third edition). Springer, New York. 

Kopans DB, Rafferty E, Georgian-Smith D, Yeh E, D' Alessandro H, Moore R, Hughes 
K, Halpern E (2003) A simple model of breast carcinoma growth may provide explanations for 
observations of apparently complex phenomena. Cancer, 97, 2951-2959. 

Moolgavkar SH, Luebeck EG (2003) Multistage carcinogenesis and the incidence of human can- 
cer. Genes, Chromosomes & Cancer, 38, 302-306. 

R Development Core Team (2010) R: A language and environment for statistical computing. R 
Foundation for Statistical Computing, Vienna, Austria. 

Severini TA (2000) Likelihood methods in statistics. Oxford University Press, Clarendon. 

SiLVERSTEiN MJ, MacDonald HR, Mabry HC, Moorthy SB (2006) Ductal Carcinoma In Situ. 
In: Winchester DJ, Winchester DP, Norton L, Hudis CA (eds.). Breast Cancer (second 
edition). B.C. Decker, Hamilton, Ontario. 

Welch HG, Woloshin S, Schwartz LM (2008) The Sea of Uncertainty Surrounding Ductal Car- 
cinoma In Situ - The Price of Screening Mammography, J Nat Cancer Inst, 100, 228-229. 

Yakovlev ay, Hanin LG, Rachev ST, Tsodikov AD (1996) A distribution of tumor size at 
detection and its limiting form, P roc Nat Acad Sci, 93, 6671-6675. 



-V -h 



